t 



(1) GENERAL INFORMATION 

(i\ APPLICANT: Coughlin, Shaun 

Ishihari, Hiroaki 
Connolly, Andrew 




riTLE OF THE INVENTION: 
3 and Uses Thereof 



(iii) NUMBER OF SEQUENCES : 28 



CORRESPONDENCE ADDRESS: 

)RESSEE: MORRISON & FOERSTER 
STfeEET: 755 Page Mill Road 
CI'BY: Palo Alto 
STaVe : CA 



(A) 
(B) 
(C) 
(D) 
(E) CO 



94304-1018 




SEQUENCE LISTING 



Protease Activated Receptor 



(v) COMPUTE^ READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE : FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/208,629 

(B) FILING DaVe: 08-DEC-1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/742 ,440 

(B) FILING DATEV 30-OCT-1996 



(viii) ATTORNEY /AGEHsTT INFORMATION: 

(A) NAME: Catherine M. Polizzi 

(B) REGISTRATION NUMBER: 40,130 

(C) REFERENCE/DOCKET NUMBER: 220002060310 



(ix) TELECOMMUNICATIC 

(A) TELEPHONE: (650^ 

(B) TELEFAX: (650) 

(C) TELEX: 



INFORMATION: 
813-5600 
£4-0792 



INFORMATION FOR $EQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1224 base flairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA 



<B) 
(C) 
(D) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 



TGACTTTGTA 
TTATCTTGGT 
ATGTTTCAGA 
AAAATACCTT 
CTATAAAAGC 
TAGGATACCT 
TTGTGGTTGG 
CCATCAGTCT 
CACTGCCATT 
TGTGCCGGAT 
CTTGCATGGG 
CCAAACGCAG 
TGCTGCCCTT 
GCCACGATGT 
CCTTAGCATT 
TCATCCACAA 
TCATCCTTGT 
ATGCCAACTA 
GCCTGGGGAG 
TAGATCAGCT 
TCTGGGAAGA 



TACTTAACAA 

TGCAGCTGGG 

CAACTCAffiCA 

TGAAGAATCTC 

GGAGTGTCCC 

GAGAAGTTGC 

TGTACCATCl 
GGTCATCTT' 

TAAGATCGCi 

CACCACGGTC 

CATCAACCGC 

CTTCTCCTTG 

TGTCATCCTG 

CGTCGACGCG 

CTTTGGGTTC 

ACTTAAATCA 

GATTTTCACA 

CTACTACCAC 

CCTGAATAGC 

TAATCCTTAG 

CATACATGCT 



CATCCTGTAG 
CTGCTGTTTC 
AAGCCAACCT 
CCACTTTCTG 
GAGGACAGTA 
TTAAGTACCC 
AACATCGTGA 
CACACCAACC 
TACCATCTCA 
iGTTTTCTACG 
TACCTGGCCA 
?TCATGTGTG 
VGCAGGAGT 
TfeCGAGTCCC 
CECATCCCGT 
AAGGATCGGA 
ATTTGCTTTG 
AAmCCGACA 
TGCGTAGATC 
TCGGfcAATGG 
TGGC 



CCGGGTCTCA 
TGCCAGTCAC 
TAACTATTAA 
ACATAGAGGG 
TTTCAACTCT 
AAGTGATACC 
CCCTGTGGAA 
TGGCCATCGC 
ATGGCAACAA 
GCAACATGTA 
CGGCTCACCC 
GCATAGTGTG 
ACCACCTCGT 
CATCATCCTT 
TTGTGATCAT 
TATGGCTGGG 
CCCCCACCAA 
GCTTGTACTT 
CATTCCTTTA 
CAAGACCACT 



GGACATCAAG 
TGTTTGCCAA 
GAGTTTTAAT 
CTGGACAGGA 
CCACGTGAAT 
TGCCATCTAT 
ACTCTCCTTA 
AGATCTCCTT 
CTGGGTATTT 
CTGCGCTATC 
TTTCACATAC 
GGTCATGGTT 
CCACTCAGAG 
CCGATTCTAC 
CATCTTCTGT 
CTACATCAAG 
CATCATACTC 
TATGTATCTT 
CTTTGTCATG 
TTAGAGACCA 



ATGAAAATCC 
AGTGGCATAA 
GGGGGTCCCC 
GCCACCACAA 
AATGCTACCA 
ATCCTGCTGT 
AGGACCAAAT 
TTCTGTGTCA 
GGCGAGGTCA 
CTGATCCTCA 
CAGAAGCTGC 
TTCTTATACA 
ATCACCACCT 
TACTTCGTCT 
TACACGACTC 
GCCGTCCTCC 
GTAATCCACC 
ATTGCTCTGT 
TCGAAAGTTG 
AGGAGAGATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1224 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 baseVpairs 

(B) TYPE: nucleic acid\ 

(C) STRANDEDNESS : doubl 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomi<\ DNA 
(ix) FEATURE: 

(A) NAME /KEY : Other 

(B) LOCATION: 1. . .1124 
(D) OTHER INFORMATION: N=A\or C or G 



or T at residues 22 and 27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



CCATATGCTA 
CAACCTTAAC 
NNNTACAACT 
CAAAGTGATA 
GACCCTGTGG 
CCTGGCCATC 
TCAATGGCAA 
ACGGCAACAT 
TGGCCACGGC 
TGTGTGGCAT 
CAGGAGTACC 
GANTCCCCAT 
ATCCCGTTTG 
GATCNGATAT 
TGCTTCCCCC 
GCATGGCACC 



ATATTTCCTT 
TATTAAGAGT 
CTCCATGTGA 
CCTGCCATCT 
AAACTCTCCT 
GCGGATCTCC 
CAACTGGGTA 
GTACTGCGCT 
TCACCCTTTC 
GGTGTGGGTC 
ACCTCGTCCA 
CATCCTTCCG 
TG AT CATC AT 
GGCTGGGCTA 
CCACCAAGNN 
ATCAGCTCAA 



TCAATTACAG 
TTTAATGGGG 
ATAATGCTAC 
ACATCCTGGT 
CAAGGACCAA 
TTTTCTGTGT 
TTTGGCGAGG 
ANNNTCCTGA 
ACATACCAGA 
ATGGTTTTCT 
CTCCGAGATC 
ATTCTACTAC 
CTTCTGTTAC 
CATCAAGGCC 
NNNNGATATC 
TTTTTAATTT 



GCATA&ATGT 
GTCCC&AAAA 
CATGGqATAC 
GTTTGT&ATT 
ATCCATqCTGT 
CACGCTGECG 
TCATGTGQCG 
TCCTCACOfG 
AGCTGCCC; 
TATACATGC^ 
ACCACCTGCC 
TTCGTCTCC] 
ACGACTCTCA 1 
GTCCTCCTCA 
TGGGAAGACG 
TTTAATTTTA 



TTCAGACAAC 
TACCTTTGAA 
CTGAGAAGTT 
GGTGTACCAG 
CTGGTCATCT 
TTTAAGATCN 
GATCACCACG 
CATGGGCATC 
ACGCAGCTTC 
GCCCTTTGTC 
ACGATGTCGT 
TAGCATTCTT 
t TCCACAAACT 
'CCTTGTGAA 
'ACATGCTTG 
TTTAATTTA 



TCAGCAAAGC 
GAATTCNNNN 
CCTTAAGTAC 
CGAACATCGT 
TTCACACCAA 
NCCTACCATC 
GTCGTTTTCT 
AACCGCTACC 
TCCATGCTCA 
ATCCNNNAAG 
CGACGCGTGC 
TGGGTTCCTC 
TAAATCAAAA 
TTTCACCATC 
GCTGACTTGT 
ATTTTATGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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TTTGAGACAG AGCCTCACTG TGTAGTCCTG GCTGGCCTGG CTGGTTCTCT ATTTAGACCA 1020 
GGTTAGCCTT GAACTCACAG AGATCTGCCT GCTTCTGCCT CCCAAGTGCT GGGTTCAACC 1080 
AGGTCTGGCA AGCGCTCCAT TTTTCAGCTC CTCTGCAACA GTGC 1124 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
Thr Leu Tyr Thr 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

TGCTCCATGA TTTTACAGAT TTCATAACGT TTAAGAGACG GGACTCAGGT CATCAAAATG 60 

AAAGCCCTCA TCTTTGCAGC TGCTGGCCTC CTGCTTCTGT TGCCCACTTT TTGTCAGAGT 120 

GGCATGGAAA ATGATACAAA CAACTTGGCA AAGCCAACCT TACCCATTAA GACCTTTCGT 180 

GGAGCTCCCC CAAATTCTTT TGAAGAGTTC CCCTTTTCTG CCTTGGAAGG CTGGACAGGA 240 

GCCACGATTA CTGTAAAAAT TAAGTGCCCT GAAGAAAGTG CTTCACATCT CCATGTGAAA 3 00 

AATGCTACCA TGGGGTACCT GACCAGCTCC TTAAGTACTA AACTGATACC TGCCATCTAC 360 

CTCCTGGTGT TTGTAGTTGG TGTCCCGGCC AATGCTGTGA CCCTGTGGAT GCTTTTCTTC 420 

AGGACCAGAT CCATCTGTAC CACTGTATTC TACACCAACC TGGCCATTGC AGATTTTCTT 480 

TTTTGTGTTA CATTGCCCTT TAAGATAGCT TATCATCTCA ATGGGAACAA CTGGGTATTT 54 0 

GGAGAGGTCC TGTGCCGGGC CACCACAGTC ATCTTCTATG GCAACATGTA CTGCTCCATT 600 

CTGCTCCTTG CCTGCATCAG CATCAACCGC TACCTGGCCA TCGTCCATCC TTTCACCTAC 660 

CGGGGCCTGC CCAAGCACAC CTATGCCTTG GTAACATGTG GACTGGTGTG GGCAACAGTT 720 

TTCTTATATA TGCTGCCATT TTTCATACTG AAGCAGGAAT ATTATCTTGT TCAGCCAGAC 780 

ATCACCACCT GCCATGATGT TCACAACACT TGCGAGTCCT CATCTCCCTT CCAACTCTAT 840 

TACTTCATCT CCTTGGCATT CTTTGGATTC TTAATTCCAT TTGTGCTTAT CATCTACTGC 900 

TATGCAGCCA TCATCCGGAC ACTTAATGCA TACGATCATA GATGGTTGTG GTATGTTAAG 960 

GCGAGTCTCC TCATCCTTGT GATTTTTACC ATTTGCTTTG CTCCAAGCAA TATTATTCTT 1020 

ATTATTCACC ATGCTAACTA CTACTACAAC AACACTGATG GCTTATATTT TATATATCTC 1080 

ATAGCTTTGT GCCTGGGTAG TCTTAATAGT TGCTTAGATC CATTCCTTTA TTTTCTCATG 1140 

TCAAAAACCA GAAATCACTC CACTGCTTAC CTTACAAAAT AGTGAAATGA TCTTAGAGAA 1200 

CAAGGACAGC CATCACAGAG AACG 1224 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



3 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ACAGGCATGG AAAATGATAC AAACAACTTG GCAAAGCCAA CCTTACCCAT TAAGACCTTT 60 

CGTGGAGCTC CCCCAAATTC TTTTGAAGAG TTCCCCTTTT CTGCCTTGGA AGGCTGGACA 120 

GGAGCCACGA TTACTGTAAA AATTAAGTGC CCTGAAGAAA GTGCTTCACA TCTCCATGTG 180 

AAAAATGCTA CCATGGGGTA CCTGACCAGC TCCTTAAGTA CTAAACTGAT ACCTGCCATC 240 

TACCTCCTGG TGTTTGTAGT TGGTGTCCCG GCCAATGCTG TGACCCTGTG GATGCTTTTC 300 

TTCAGGACCA GATCCATCTG TACCACTGTA TTCTACACCA ACCTGGCCAT TGCAGATTTT 360 

CTTTTTTGTG TTACATTGCC CTTTAAGATA GCTTATCATC TCAATGGGAA CAACTGGGTA 420 

TTTGGAGAGG TCCTGTGCCG GGCCACCACA GTCATCTTCT ATGGCAACAT GTACTGCTCC 4 80 

ATTCTGCTCC TTGCCTGCAT CAGCATCAAC CGCTACCTGG CCATCGTCCA TCCTTTCACC 540 

TACCGGGGCC TGCCCAAGCA CACCTATGCC TTGGTAACAT GTGGACTGGT GTGGGCAACA 600 

GTTTTCTTAT ATATGCTGCC ATTTTTCATA CTGAAGCAGG AATATTATCT TGTTCAGCCA 660 

GACATCACCA CCTGCCATGA TGTTCACAAC ACTTGCGAGT CCTCATCTCC CTTCCAACTC 720 

TATTACTTCA TCTCCTTGGC ATTCTTTGGA TTCTTAATTC CATTTGTGCT TATCATCTAC 780 

TGCTATGCAG CCATCATCCG GACACTTAAT GCATACGATC ATAGATGGTT GTGGTATGTT 84 0 

AAGGCGAGTC TCCTCATCCT TGTGATTTTT ACCATTTGCT TTGCTCCAAG CAATATTATT 900 

CTTATTATTC ACCATGCTAA CTACTACTAC AACAACACTG ATGGCTTATA TTTTATATAT 960 

CTCATAGCTT TGTGCCTGGG TAGTCTTAAT AGTTGCTTAG ATCCATTCCT TTATTTTCTC 1020 

ATGTCAAAAA CCAGAAATCA CTCCACTGCT TACCTTACAA AATAGTGAAA TGATCTTAGA 1080 

GAACAAGGAC AGCCATCACA GA 1102 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Cys Ser Met lie Leu Gin lie Ser 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met Gly Pro Arg Arg Leu Leu Leu 

1 5 
Gly Pro Leu Leu Ser Ala Arg Thr 
20 

Ala Thr Asn Ala Thr Leu Asp Pro 

35 40 
Asn Asp Lys Tyr Glu Pro Phe Trp 

50 55 
Gly Leu Thr Glu Tyr Arg Leu Val 
65 70 
Gin Lys Gin Leu Pro Ala Phe lie 
85 

Thr Ser Ser Trp Leu Thr Leu Phe 
100 



Val 


Ala 


Ala 


Cys 


Phe Ser Leu 


Cys 




10 






* 15 




Arg 


Ala 


Arg 


Arg 


Pro Glu Ser 


Lys 


25 








30 




Arg 


Ser 


Phe 


Leu 


Leu Arg Asn 


Pro 










45 




Glu 


Asp 


Glu 


Glu 


Lys Asn Glu 


Ser 








60 






Ser 


lie 


Asn 


Lys 


Ser Ser Pro 


Leu 






75 






80 


Ser 


Glu 


Asp 


Ala 


Ser Gly Tyr 


Leu 




90 






95 




Val 


Pro 


Ser 


Val 


Tyr Thr Gly Val 


105 








110 





4 




rne 


vai 


vai 


Ser 


lieu rlO 


Leu 


Asn 


Tip 


M<=»r- 
ncu 


Ala 


He 


Vai 


Vai 


Phe 


i.Xc 






115 








1 O A 

lzU 










125 








Leu 


Lys 


Met 


Lys 


vai i»ys 


Lys 


Pro 


Aia 


vai 


Vai 


Tyr Met 


Leu 


His 


Leu 




130 








IOC 

lib 










140 










TV T _ 

Ala 


inr 


Aia 


Asp 


vai lieu 


pne 


vai 


Ser 


vai 


Leu 


Pro 


Phe 


Lys 


He 


Ser 


145 








150 










155 










T C A 

Id 0 


Tyr 


Tyr 


Fne 


Ser 


tjiy oer 


Asp 


Trp 


Ljin 


fne 


Gly Ser Glu 


Leu. Cys 


Arg 










165 








170 










175 




Phe 


vai 


Thr 


Aia 


Aia pne 


Tyr 


Cys 


Asn 


Met 


Tyr Ala 


Ser 


He 


Leu 


Leu 








180 








185 










190 






Met 


Thr 


Vai 


-r 1 ^ 

He 


Ser He 


Asp 


Arg 


Phe 


Leu 


Ala 


Vai 


Vai 


Tyr 


Pro 


Met 






195 








200 










205 








Gin 


Ser 


Leu 


Ser 


Trp Arg 


Thr 


Leu 


Gly 


Arg 


Ala 


Ser 


Phe 


Thr 


Cys 


Leu 




210 








215 










220 










Ala 


He 


Trp 


TV 1 — 

Ala 


Leu Ala 


lie 


Ala 


Gly 


Vai 


Vai 


Pro 


Leu 


Vai 


Leu 


Lys 


225 








230 










235 










24 0 


Glu 


Gin 


Thr 


lie 


Gin Vai 


Pro 


Cjiy 


Leu 


Asn 


He 


Thr 


Thr 


Cys 


His 


Asp 










245 








250 










255 




vai 


Leu 


Asn 


CjIU 


Thr Leu 


Leu 


(jIU 


(jiy 


Tyr 


Tyr Ala Tyr 


Tyr 


Phe 


Ser 








260 








265 










270 






Ala 


Pne 


Ser 


Ala 


Vai Phe 


Phe 


Phe 


vai 


Pro 


Leu 


He 


He 


Ser 


Thr 


vai 






275 








280 










285 








Cys 


Tyr 


vai 


Ser 


lie lie 


Arg 


Cys 


Leu 


Ser 


Ser 


Ser 


Ala 


Vai 


Ala 


Asn 




290 








295 










300 










Arg 


Ser 


Lys 


Lys 


Ser Arg 


ft "I -» 

Ala 


Leu 


pne 


Leu 


Ser 


Ala 


Ala 


Vai 


Phe 


Cys 


O A C 

305 








OTA 
310 










315 










*3 O A 


i ie 


-fne 


lie 


Tin 

lie 


uys fne 


(jiy 


Pro 


i nr 


Asn 


Vai 


Leu 


Leu 


He 


Ala 


ill s 










325 








330 










335 




Tyr 


Ser 


pne 


Leu 


Ser His 


inr 


Ser 


inr 


inr 


Glu 


Ala 


Ala 


Tyr 


Phe 


a j. a 








340 








345 










350 






Tyr 


Leu 


Leu 


Cys 


Vai Cys 


vai 


Ser 


Ser 


i ie 


Ser 


Ser 


Cys 


He 


Asp 


Pro 






355 








3 60 










365 








Leu 


± j.e 


Tyr 


iyr 


T V\r-y~ Ala 
lyi M. _L ci 


oer 


Coy 


Cl "1 , , 




Gin Arg 


Tyr 


Vai 


Tyr 


OCT I. 




370 








375 










380 










He 


Leu 


Cys 


Cys 


Lys Glu 


Ser 


Ser 


Asp 


Pro 


Ser 


Ser 


Tyr 


Asn 


Ser 


Ser 


385 








390 










395 










400 


Gly 


Gin 


Leu 


Met 


Ala Ser 


Lys 


Met 


Asp 


Thr 


Cys 


Ser 


Ser 


Asn 


Leu 


Asn 










405 








410 










415 




Asn 


Ser 


He 


Tyr 


Lys Lys 


Leu 


Leu 


Thr 

















420 425 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 

Met Arg Ser Pro Ser Ala Ala Trp Leu Leu Gly Ala Ala He Leu Leu 

15 10 15 

Ala Ala Ser Leu Ser Cys Ser Gly Thr lie Gin Gly Thr Asn Arg Ser 

20 25 30 

Ser Lys Gly Arg Ser Leu He Gly Lys Vai Asp Gly Thr Ser His Vai 

35 40 45 

Thr Gly Lys Gly Vai Thr Vai Glu Thr Vai Phe Ser Vai Asp Glu Phe 



5 






50 










55 










60 










Ser 


Ala 


Ser 


Vai 


Leu 


Thr 


Gly 


Lys 


Leu 


Thr 


Thr 


Vai 


Phe 


Leu 


Pro 


lie 


65 










70 










75 










80 


vai 


Tyr 


Thr 


lie 


vai 


Pne 


vai 


vai 


Gly Leu 


Pro 


Ser Asn Gly 


Met 


Ala 










85 










90 










95 




Leu 


Trp 


vai 


Phe 


Leu 


pne 


Arg 


Thr 


Lys 


Lys 


Lys 


His 


Pro 


Ala 


vai 


Tl- 
116 








100 










105 










110 






Tyr 


Met 


Ala 


Asn 


Leu 


Ala 


Leu 


Ala 


Asp 


Leu 


Leu 


Ser 


Vai 


He 


Trp 


Pne 






115 










120 










125 








Pro 


Leu 


Lys 


lie 


Ala 


Tyr 


His 


lie 


His 


Gly Asn Asn Trp 


He 


Tyr 


Gly 




130 










135 










140 










Glu 


Ala 


Leu 


Cys 


Asn 


Vai 


Leu 


lie 


Gly Phe 


Phe 


Tyr Gly Asn 


Met 


Tyr 


145 










150 










155 










160 


Cys 


Ser 


lie 


Leu 


Phe 


Met 


Thr 


Cys 


Leu 


Ser 


Vai 


Gin 


Arg 


Tyr 


Trp 


Vai 








165 










170 










175 




lie 


Vai 


Asn 


Pro 


Met 


Gly 


His 


Ser 


Arg 


Lys 


Lys 


Ala 


Asn 


He 


Ala 


He 








180 










185 










190 






Gly 


lie 


Ser 


Leu 


Ala 


lie 


Trp 


Leu 


Leu 


He 


Leu 


Leu 


Vai 


Thr 


He 


Pro 






195 










200 










205 








Leu 


Tyr 


Vai 


Vai 


Lys 


Gin 


Thr 


lie 


Phe 


He 


Pro 


Ala 


Leu 


Asn 


He 


Thr 




210 










215 










220 










Thr 


Cys 


His 


Asp 


vai 


Leu 


Pro 


Glu 


Gin 


Leu 


Leu 


Vai 


Gly Asp 


Pro 


Pne 


225 










230 










235 










240 


Leu 


Ser 


Leu 


Ala 


lie 


Gly 


Vai 


Phe 


Leu 


Phe 


Pro 


Ala 


Phe 


Leu 


Thr 


Ala 










245 










250 










255 




Ser 


Ala Tyr Vai 


Leu 


Met 


lie 


Arg 


Met 


Leu Arg 


Ser 


Ser 


Ala 


Met 


Asp 








260 










265 










270 






Glu 


Asn 


Ser 


Glu 


Lys 


Lys 


Arg 


Lys 


Arg 


Ala 


He 


Lys 


Leu 


He 


Vai 


Thr 






275 










280 










285 








vai 


Leu 


Ala 


Met 


Tyr 


Leu 


lie 


Cys 


Phe 


Thr 


Pro 


Ser 


Asn 


Leu 


Leu 


Leu 




290 










295 










300 










Vai 


Vai 


His 


Tyr 


Phe 


Leu 


lie 


Lys 


Sex- 


Gin 


Gly Gin 


Ser 


His 


TJ_ T 

Vai 


Tyr 


305 










310 










315 










320 


Ala 


Leu 


Tyr 


lie 


Vai 


Ala 


Leu 


Cys 


Leu 


Ser 


Thr 


Leu 


Asn 


Ser 


Cys 


He 










325 










330 










335 




Asp 


Pro 


Phe 


Vai 


Tyr 


Tyr 


Pne 


Vai 


Ser 


His 


Asp 


Phe 


Arg 


Asp 


His 


Ala 








340 










345 










350 






Lys 


Asn 


Ala 


Leu 


Leu 


Cys 


Arg 


Ser 


Vai 


Arg 


Thr 


Vai 


Lys 


Gin 


Met 


Gin 






355 










360 










365 








Vai 


Ser 


Leu 


Thr 


Ser 


Lys 


Lys 


His 


Ser Arg 


Lys 


Ser 


Ser 


Ser 


Tyr 


Ser 




370 










375 










380 










Ser 


Ser 


Ser 


Thr 


Thr 


Vai 


Lys 


Thr 


Ser 


Tyr 















385 390 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Asp Phe Glu Glu He Pro Glu Glu Tyr Leu Gin 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY : Other 

(B) LOCATION: 1. . .29 

(D) OTHER INFORMATION: N=Inosine at residues 3, 12, 18, 21, and 24 



(A) NAME /KEY : Other 

(B) LOCATION: 22 . . .27 

(D) OTHER INFORMATION: N=A or C or G or T at residues 22 and 27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTNTACATGC TNMACYTNGC NNTNGCNGA 29 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 6 . . .21 

(D) OTHER INFORMATION: N=Inosine at residue 6, 9, 12, 15, and 21 

(ix) FEATURE: 

(A) NAME /KEY : Other 

(B) LOCATION: 24 

(D) OTHER INFORMATION: N=A or C or G or T at residue 24 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGATANACNA CNGCNADRWA NCKNTC 26 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
Asp Tyr Lys Asp Asp Asp 



7 




1 



5 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Asp Ser Lys Gly Ser Ser Gin Lys Gly Ser Arg Leu Leu Leu Leu 

15 10 15 

Leu Val Val Ser Asn Leu Leu Leu Cys Gin Gly Val Val Ser Asp Tyr 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14- 

Phe Glu Glu Phe Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Leu Thr Pro Lys 
1 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



20 

Lys Asp Asp Asp Asp Val Glu 
35 



25 



30 



8 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Thr Phe Arg Gly Ala Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Thr Phe Arg Gly Ala Pro Pro Asn Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Leu Pro lie Lys Thr Phe Arg Gly Ala Pro Pro Asn Ser Phe Glu Glu 

15 10 15 

Phe Pro Phe Ser Ala Leu Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID- NO: 19: 
Leu Pro lie Lys Pro Phe Arg Gly Ala Pro Pro Asn Ser Phe Glu Glu 



Phe Pro Phe Ser Ala Leu Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



1 



5 



10 



15 



9 




<ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Leu Pro lie Xaa Thr Pro Phe Arg Gly Ala Pro Pro Asn Ser Phe Glu 

15 10 15 

Glu Phe Pro Phe Ser Ala Leu Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Xaa Pro Arg Pro Phe Arg Gly Ala Pro Pro Asn Ser Phe Glu Glu Phe 

15 10 15 

Pro Phe Ser Ala Leu Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Leu Pro lie Lys 
1 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Thr Phe Arg Gly Ala Pro Pro Asn Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 380 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



Gin 


His 


Pro 


Val 


Ala 


Gly 


Ser 


Gin 


Asp 


lie 


Lys 


Met 


Lys 


Ti- 
ne 


Leu 


lie 


1 








5 










10 










15 




Leu 


Val 


Ala 


Ala 


Gly 


Leu 


Leu 


Pne 


Leu 


Pro 


Val 


Thr 


val 


Cys 


Gin 


Ser 








20 










25 










30 






Gly 


He 


Asn 


Val 


Ser 


Asp 


Asn 


Ser 


Ala 


Lys 


Pro 


Thr 


Leu 


Thr 


Tl Q 

lie 


Lys 






35 










40 










45 








Ser 


Phe 


Asn 


Gly 


Gly 


Pro 


Gin 


Asn 


Thr 


Phe 


Glu 


Glu 


Phe 


Pro 


Leu 


Ser 




50 










55 










60 










Asp 


He 


Glu 


Gly 


Trp 


Thr 


Gly 


Ala 


Thr 


Thr 


Thr 


-¥- 1 _ 

He 


Lys 


Ala 


Glu 


Cys 


65 










70 










75 










80 


Pro 


Glu 


Asp 


Ser 


He 


Ser 


Thr 


Leu 


His 


Val 


Asn 


Asn 


Ala 


Thr 


I le 


Gly 










85 










90 










95 




Tyr 


Leu 


Arg 


Ser 


Ser 


Leu 


Ser 


Thr 


Gin 


T T — T 

Val 


He 


Pro 


Ala 


He 


Tyr 


Ti- 
ne 








100 










105 










110 






Leu 


Leu 


Phe 


Val 


Val 


Gly 


Val 


Pro 


Ser 


Asn 


He 


Val 


Thr 


Leu 


Trp 


Lys 






115 










120 










125 








Leu 


Ser 


Leu 


Arg 


Thr 


Lys 


Ser 


lie 


Ser 


Leu 


Val 


He 


Phe 


His 


Thr 


Asn 




130 










135 










140 










Leu 


Ala 


He 


Ala 


Asp 


Leu 


Leu 


Pne 


Cys 


Val 


Thr 


Leu 


Pro 


Phe 


Lys 


Ti- 
ne 


145 










150 










155 










160 


Ala 


Tyr 


His 


Leu 


Asn 


Gly 


Asn 


Asn 


Trp 


Val 


Phe 


Gly Glu Val 


Met 


Cys 










165 










170 










175 




Arg 


lie 


Thr 


Thr 


val 


val 


pne 


Tyr 


oiy 


Asn 


Met 


Tyr 


Cys 


Ala 


Tic* 

i le 


Leu 








180 










185 










190 






lie 


Leu 


Thr 


Cys 


Met 


Gly 


-r- "1 _ 

He 


Asn 


Arg 


Tyr 


Leu 


Ala 


Thr 


Ala 


His 


Pro 






195 










200 










205 








Phe 


Thr 


Tyr 


Gin 


Lys 


Leu 


Pro 


Lys 


Arg 


Ser 


pne 


Ser 


Leu 


Leu 


wet 


Cys 




210 










215 










220 










Gly 


He 


Val 


Trp 


Val 


Met 


Val 


Pne 


Leu 


Tyr 


Met 


Leu 


Pro 


Phe 


val 


lie 


225 








230 










235 










240 


Leu 


Lys 


Gin 


Glu 


Tyr 


His 


Leu 


Val 


His 


Ser 


Glu 


He 


Thr 


Thr 


Cys 


His 










245 










250 










255 




Asp 


Val 


Val 


Asp 


Ala 


Cys 


Glu 


Ser 


Pro 


Ser 


Ser 


Phe 


Arg 


Phe 


Tyr 


Tyr 








260 










265 










270 






Phe 


Val 


Ser 


Leu 


Ala 


Phe 


Phe 


Gly 


Phe 


Leu 


He 


Pro 


Phe 


Val 


lie 


lie 






275 










280 










285 








lie 


Phe 


Cys 


Tyr 


Thr 


Thr 


Leu 


He 


His 


Lys 


Leu 


Lys 


Ser 


Lys 


Asp 


Arg 




290 










295 










300 










He 


Trp 


Leu 


Gly 


Tyr 


He 


Lys 


Ala 


Val 


Leu 


Leu 


He 


Leu 


Val 


He 


Phe 


305 










310 










315 










320 


Thr 


He 


Cys 


Phe 


Ala 


Pro 


Thr 


Asn 


He 


He 


Leu 


Val 


He 


His 


His 


Ala 










325 










330 










335 




Asn 


Tyr 


Tyr 


Tyr 


His 


Asn 


Thr 


Asp 


Ser 


Leu 


Tyr 


Phe 


Met 


Tyr 


Leu 


He 








340 










345 










350 






Ala 


Leu 


Cys 


Leu 


Gly 


Ser 


Leu 


Asn 


Ser 


Cys 


Leu 


Asp 


Pro 


Phe 


Leu 


Tyr 






355 










360 










365 








Phe 


Val 


Met 


Ser 


Lys 


Val 


Val 


Asp 


Gin 


Leu Asn 


Pro 












370 










375 










380 











(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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Ser Ala Met Ala Arg Pro Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Arg Pro Arg Arg Asp He Trp Glu Asp He His Ala Trp 
15 10 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cys 


Ser 


Met 


He 


Leu 


Gin 


He 


Ser 


Xaa 


Arg Leu Arg Asp Gly 


Thr 


Gin 


1 








5 










10 






15 




Val 


He 


Lys 


Met 


Lys 


Ala 


Leu 


He 


Phe 


Ala Ala Ala Gly Leu 


Leu 


Leu 








20 










25 






30 






Leu 


Leu 


Pro 


Thr 


Phe 


Cys 


Gin 


Ser 


Gly 


Met 


Glu Asn Asp Thr 


Asn 


Asn 






35 










40 








45 






Leu 


Ala 


Lys 


Pro 


Thr 


Leu 


Pro 


He 


Lys 


Thr 


Phe Arg Gly Ala 


Pro 


Pro 




50 










55 










60 






Asn 


Ser 


Phe 


Glu 


Glu 


Phe 


Pro 


Phe 


Ser 


Ala 


Leu 


Glu Gly Trp 


Thr Gly 


65 










70 










75 






80 


Ala 


Thr 


lie 


Thr 


Val 


Lys 


He 


Lys 


Cys 


Pro 


Glu 


Glu Ser Ala 


Ser 


His 










85 










90 






95 




Leu 


His 


Val 


Lys 


Asn 


Ala 


Thr 


Met 


Gly 


Tyr 


Leu 


Thr Ser Ser 


Leu 


Ser 








100 










105 






110 






Thr 


Lys 


Leu 


He 


Pro 


Ala 


He 


Tyr 


Leu 


Leu 


Val 


Phe Val Val 


Gly Val 






115 










120 








125 






Pro 


Ala 


Asn 


Ala 


Val 


Thr 


Leu 


Trp 


Met 


Leu 


Phe 


Phe Arg Thr 


Arg 


Ser 




130 










135 










140 






He 


Cys 


Thr 


Thr 


Val 


Phe 


Tyr 


Thr 


Asn 


Leu 


Ala 


lie Ala Asp 


Phe 


Leu 


145 










150 










155 






160 


Phe 


Cys 


Val 


Thr 


Leu 


Pro 


Phe 


Lys 


He 


Ala 


Tyr 


His Leu Asn 


Gly Asn 










165. 










170 






175 




Asn 


Trp 


Val 


Phe 


Gly 


Glu 


Val 


Leu 


Cys 


Arg 


Ala 


Thr Thr Val 


He 


Phe 








180 










185 






190 






Tyr 


Gly Asn 


Met 


Tyr 


Cys 


Ser 


He 


Leu 


Leu 


Leu 


Ala Cys He 


Ser 


He 






195 










200 








205 






Asn 


Arg 


Tyr 


Leu 


Ala 


He 


Val 


His 


Pro 


Phe Thr Tyr Arg Gly 


Leu 


Pro 




210 










215 










220 






Lys 


His" 


Thr 


Tyr 


Ala 


Leu 


Val 


Thr 


Cys 


Gly 


Leu 


Val Trp Ala 


Thr 


Val 


225 










230 










235 






240 


Phe 


Leu 


Tyr 


Met 


Leu 


Pro 


Phe 


Phe 


He 


Leu 


Lys 


Gin Glu Tyr 


Tyr 


Leu 










245 










250 






255 




Val 


Gin 


Pro 


Asp 


He 


Thr 


Thr 


Cys 


His 


Asp 


Val 


His Asn Thr 


Cys 


Glu 








260 










265 






270 






Ser 


Ser 


Ser 


Pro 


Phe 


Gin 


Leu 


Tyr 


Tyr 


Phe 


He 


Ser Leu Ala 


Phe 


Phe 






275 










280 








285 






Gly 


Phe 


Leu 


He 


Pro 


Phe 


Val 


Leu 


He 


He 


Tyr 


Cys Tyr Ala 


Ala 


He 
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290 295 300 

lie Arg Thr Leu Asn Ala Tyr Asp His Arg Trp Leu Trp Tyr Val Lys 
305 \ 310 315 320 

Ala Ser Leu Leu vie Leu Val lie Phe Thr lie Cys Phe Ala Pro Ser 

325 330 335 

Asn lie lie Leu Il\ lie His His Ala Asn Tyr Tyr Tyr Asn Asn Thr 

340 \ 345 350 

Asp Gly Leu Tyr Phe Vie Tyr Leu lie Ala Leu Cys Leu Gly Ser Leu 

355 \ 360 365 

Asn Ser Cys Leu Asp P^p Phe Leu Tyr Phe Leu Met Ser Lys Thr Arg 
370 \ 375 380 

n His Ser Thr Ala Tyr\Leu Thr Lys 
390 

(2) INFORMATION \FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amirto acids 

(B) TYPE: amino acic 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



Asn Asp Leu Arg Glu Gin Gly Gin Pro Ser Gin Arg Thr 
1 5 \ 10 
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